A Comparative Study of Search Result Diversification Methods

نویسندگان

  • Wei Zheng
  • Hui Fang
چکیده

Top-ranked documents returned by traditional retrieval functions may cover the same piece of relevant information and cannot satisfy different user needs. Search result diversification solves this problem by diversifying results to cover more information needs, i.e., query subtopics, in top-ranked documents. Many diversification methods have been proposed and studied, and most of them re-rank original retrieved documents according to both relevance and diversity functions in a probabilistic framework. Although official TREC results make it possible to compare the effectiveness of different diversification systems, it remains unclear whether the better performance of a system comes from better diversification methods or component estimation methods. In this paper, we conduct a systematic study on comparing three representative diversification methods which can be implemented using probabilistic methods. We not only analytically compare the methods but also conduct empirical studies and evaluate the effectiveness of these methods in a controlled manner.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Impact of Comparative Advantage of Agricultural Triple Industries and Export Diversification on the Value-Added Industries in Iran

Developmental plans of the country emphasize on the exports-focused growth strategy, and export diversification is one of the most appropriate policies in this area. Export diversification moves from primary goods to industrial goods. Yet, export diversification, according to the principles of international trade, must be based on comparative advantage until to change value-added. Changes in th...

متن کامل

Analysis of Limitation in Rural Economy Diversification Case: Upper Ashkevar in Rudsar County

Introduction In economy structure of rural districts in different countries, agriculture is considered as the main source of livelihood. The most important characteristic of this structure is lack of diversification for economic contexts and job opportunities, especially for the increasing number of people in the villages which is almost the result of attitude toward the village and government...

متن کامل

Leveraging Dynamic Query Subtopics for Time-Aware Search Result Diversification

Search result diversification is a common technique for tackling the problem of ambiguous and multi-faceted queries by maximizing query aspects or subtopics in a result list. In some special cases, subtopics associated to such queries can be temporally ambiguous, for instance, the query US Open is more likely to be targeting the tennis open in September, and the golf tournament in June. More pr...

متن کامل

A Query Classification Scheme For Diversification

Search result diversification enables the modern day search engines to construct a result list that consists of documents that are relevant to the user query and at the same time, diverse enough to meet the diverse user expectations. However, all the queries received by a search engine may not benefit from diversification. Further, different types of queries may benefit from different diversifi...

متن کامل

Evaluating subtopic retrieval methods: Clustering versus diversification of search results

To address the inability of current ranking systems to support subtopic retrieval, two main post-processing techniques of search results have been investigated: clustering and diversification. In this paper we present a comparative study of their performance, using a set of complementary evaluation measures that can be applied to both partitions and ranked lists, and two specialized test collec...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011